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ABSTRACT 

Fluorescent monitoring of DNA amplification is the 
basis of real-time PCR, from which target DNA con- 
centration can be determined from the fractional 
cycle at which a threshold amount of amplicon DNA 
is produced. Absolute quantification can be 
achieved using a standard curve constructed by 
amplifying known amounts of target DNA. In this 
study, the mathematics of quantitative PCR are 
examined in detail, from which several fundamental 
aspects of the threshold method and the application 
of standard curves are illustrated. The construction 
of five replicate standard curves for two pairs of 
nested primers was used to examine the repro- 
ducibility and degree of quantitative variation 
using SYBER® Green I fluorescence. Based upon 
this analysis the application of a single, well- 
constructed standard curve could provide an 
estimated precision of ±6-21%, depending on the 
number of cycles required to reach threshold. A 
simplified method for absolute quantification is also 
proposed, in which quantitative scale is determined 
by DNA mass at threshold. 



INTRODUCTION 

Kinetic PCR (kPCR) allows quantification of a target DNA 
within a sample, with the advantage that sensitivity is 
independent of copy number (1-4). The key aspect differen- 
tiating kPCR from previous quantitative PCR methodologies 
is that target copy number is determined from the fractional 
cycle at which a threshold amount of amplicon DNA is 
reached (threshold cycle or Q), set at a point where amplicon 
DNA just becomes detectable, but is still within the expo- 
nential phase of the amplification (5-7). This approach 
ensures that interfering factors associated with the late stages 
of amplification are minimized, and provides the potential for 
unprecedented precision for quantitative determinations. 

Although several methods have been developed to measure 
Q, all are based upon fluorescent monitoring of amplicon 
DNA generation (8-10). Absolute quantification can be 
achieved using a standard curve, constructed by amplifying 
known amounts of target DNA in a parallel group of reactions 



run under identical conditions to that of the sample (7,11). 
Standard curve preparation is both labour intensive and error 
prone, with quantitative accuracy being dependent on both the 
accuracy of DNA standard quantification and the quality of 
standard curve construction (1,12,13). 

In this study, a detailed examination of the mathematics 
governing PCR yielded insights into the process of quantita- 
tive kPCR and into some of the fundamental aspects of the 
threshold method. This provided a foundation from which to 
examine the reproducibility of standard curve construction. 
Assessment of intra- and inter-run variation indicates that a 
substantial degree of precision can be achieved, even with the 
application of a single standard curve to multiple runs. An 
alternative approach is also proposed in which quantitative 
scale is determined by DNA mass at threshold, such that 
absolute quantification would only require determination of 
amplification efficiency. 

MATERIALS AND METHODS 

The DNA standard consisted of a 218 bp amplicon produced 
by the K3/K2 primer pair (forward K3: GGCACCTC- 
AGGAATGGGCTATTACAA and reverse K2: AGAATA- 
ACACAGAAATCTGTAGGTGGAATTGAA) that was puri- 
fied by chloroform extraction followed by isopropanol 
precipitation, and quantified by averaging three replicate 
A 2 6o absorbance determinations conducted on two spectro- 
photometers. A second 102 bp amplicon was produced by 
pairing of K2 with another primer (forward Kl: 
TCCTATGAGATTATGACGCATTTCTCCAAA) located 
near the center of the K3/K2 amplicon. The primer pair 
combinations of K3/K2 and the nested K1/K2 thus allowed the 
production of two different-sized amplicons (218 and 102 bp, 
respectively) using the same DNA standard dilution series. 

PCR amplifications were conducted using QuantiTect™ 
Syber® Green PCR Kit (Qiagen Inc.) according to the 
manufacturer's instructions, with 0.25 U.M primers and a 
variable amount of DNA standard in a 35 ul final reaction 
volume. Thermocycling was conducted using an Opticon2 
DNA Engine (MJ Research Inc.) initiated by a 15 min 
incubation at 94°C, followed by 45 cycles (90°C, 1 s; 62°C, 
120 s) with a single fluorescent reading taken at the end of 
each cycle. Each run was completed with a melting curve 
analysis to confirm the specificity of amplification and lack of 
primer dimers. Q values were determined by the Opticon2 
software using a fluorescence threshold manually set to 0.0160 
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for all runs and exported into a MS Excel workbook 
(Microsoft Inc.) for analysis (available as Supplementary 
Material). 



RESULTS 

Mathematics of quantitative kPCR 

The basic equation describing PCR amplification is: 

N c = N 0 -(E + l) c 1 

where C is the number of thermocycles, E is amplification 
efficiency (also expressed as %E = E X100%), N c is the 
number of amplicon molecules and N 0 is the initial number of 
target molecules. 

In simple terms, each thermocycle produces an increase in 
N c in proportion to amplification efficiency, such that 100% 
efficiency produces a doubling in the number of amplicon 
molecules. Additionally, the quantity of N c present after any 
specific number of thermocycles is dependent on N 0 . 
Rearrangement of equation 1 provides the mathematical 
relationship upon which quantitative kPCR is based: 

N 0 = N C /(E + l) c 2 

Quantification of N c thus allows N 0 to be calculated if 
amplification efficiency is known. A major breakthrough for 
quantitative PCR came with the use of DNA fluorescence to 
monitor amplicon accumulation (3,5). Based upon this 
technique Higuchi et al. (5) developed an elegant method 
that simplifies N c determination, such that individual ampli- 
fication reactions are compared at the point at which they 
contain identical amounts of amplicon DNA. This is accom- 
plished by selecting a fluorescent threshold (F t ) from which 
the fractional thermocycle (Q) is calculated that defines the 
theoretical point at which each amplification reaction reaches 
fluorescence threshold. 

Under this 'threshold' method, N c becomes a constant such 
that equation 2 becomes: 

N 0 = N t /(E + 1) Q 3 

where C t is the threshold cycle and N t is the number of 
amplicon molecules at fluorescent threshold. 

Absolute quantification can be achieved using a standard 
curve constructed by amplification of known amounts of target 
DNA and plotting the resulting Q values against target DNA 
concentration. The mathematical basis of a standard curve can 
be derived by taking the logarithm of equation 3: 

Log(N 0 ) = Log(N t ) - Log[(E + 1) Q ] 
Log(N„) = Log(\)-Log(I£+ D-C, 

Log(N 0 ) = -Log(E + 1>C, + Log (N t ) 4 

Assuming E and N t are constants, equation 4 has the general 
structure of a line (y = mx + b) such that plotting Log(N 0 ) 
versus Q produces a line with: 

Slope = -Log(E + 1) 

E s = 10- sl °P e - 1 5 



and 

Intercept = Log(N t ) 

N t = lO 1 "' 6 ^ 1 6 

where E s is the slope-derived estimate of amplification 
efficiency. 

Although the ability to derive amplification efficiency from 
the slope of a standard curve has been widely reported, it has 
not been generally recognized that the number of amplicon 
molecules at threshold can be directly determined from the 
intercept. It must also be stressed that these derivations are 
valid only if all PCR reactions have identical amplification 
efficiencies, and only if amplification efficiency is invariant 
over the number of thermocycles required to reach Q. 

Another important but often overlooked aspect of the 
threshold method is the interdependency of Q and N t on F t , 
which has two important implications. First, C t values 
generated from different amplification runs can be directly 
compared only if an identical F t is used for each run. Second, 
the relationship between N, and F t is dependent on amplicon 
size. This is due to the fact that the underlying determinant of 
F, is DNA fluorescence, which in turn has a linear relationship 
with DNA mass. As such F t directly reflects DNA mass at 
threshold, which is related to N, as described by: 

M t = (N t -A s )/9.1 X 10 11 7 

where M, is the DNA mass at threshold in nanograms, A s is 
the amplicon size in base pairs and 9. 1 X 10 11 is the number of 
single base pair molecules per nanogram. 

A less obvious but potentially significant extension of this is 
that if M, is known, N, can be predicted for any amplicon of 
known size, if it is assumed that amplicon size and base pair 
composition do not significantly influence DNA fluorescence. 
To test the general utility of PCR mathematics for standard 
curve evaluation and to examine the effectiveness of M t for 
predicting N t , a series of replicate standard curves was 
constructed for two amplicons that differ significantly in size. 

Experimental design for constructing replicate standard 

Figure 1 is an example of the two types of graphic output 
generated by the instrument used in this study, and illustrates 
the two basic steps in quantitative kPCR using the threshold 
method, i.e. the selection of a fluorescent threshold from 
which Q values are generated (Fig. 1A), followed by linear 
regression analysis of a Log(N 0 ) versus Q plot, from which E s 
and N t are estimated (Fig. IB). 

The major consideration for F t selection is that it falls within 
the exponential phase of the amplification reaction, best 
illustrated by plotting log fluorescence versus cycle number 
(Fig. 1A). As long as F t is within this log-linear region, the 
absolute value of F t was found to have only a modest impact 
on the slope-derived estimate of amplification efficiency (data 
not shown). However, as outlined above, F t does have a direct 
impact on both Q and N t such that F t must be fixed if data from 
multiple runs are to be directly compared. 

To evaluate the reproducibility and quantitative variation of 
the threshold method, five replicate standard curves were 
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Figure 1. Output of a typical amplification run used for construction of the standard curves in this study. Four replicate amplification reactions were 
conducted lor each concentration of DNA standard, covering six magnitudes of target DNA concentration. (A) Plot of the log of reaction fluorescent versus 
tliciniocu Ic ili. H pro\idcs a j'.raphic representation for each amplification reaction, in which the linear region represents the exponential phase of the 
amplification. Fluorescent threshold provides the reference point from which the threshold cycle (C,) is calculated. (B) The resulting standard curve generated 
In plotting the log of target DNA concentration versus threshold cycle. Linear regression analysis is used to determine the slope and intercept, which 
correspond. :v-|\\ n\eK . to the ati:plilicatio:i ellicie ic; . id the lumber of amplicon molecules at threshold, as described by equation 4. 



generated from two pairs of nested primers (K3/K2 and 
K1/K2, see Materials and Methods for additional details) 
using a DNA standard dilution series covering six magnitudes 
of target DNA concentration. The use of nested primers 
allowed two different-sized amplicons (218 and 102 bp. 
respectively) to be amplified side-by-side within the same run, 
using the same DNA standard dilution series. Intra- and inter- 
run variation could then be examined for each of the two 
amplicons, free of errors caused by variations in the DNA 
standard. Using an identical F t for all runs, the average Q of 
four replicate amplifications for each DNA concentration were 
used in the analysis (Table 1). A spreadsheet containing the 
individual Q values and the calculations used for their 
analysis is provided as Supplementary Material. 

Intra- and inter-run variation in C t 

As an initial step for evaluation of quantitative precision, the 
reproducibility of amplification under our experimental con- 
ditions was estimated, based upon the standard deviation in Q 
values generated from replicate amplifications. Moreover, due 
to the exponential scale of Q, the impact of its variation can be 
difficult to assess, and thus the standard deviations in Q were 
also used to estimate the variation in percent molecules based 
upon the equation: 

±%Molecules = [(E + 1) SD - 1] X 100% 8 

where SD is the standard deviation in Q generated from 
replicate amplifications. 



Overall the standard deviation in C, of replicate amplifica- 
tions ranged from 0.036 to 0.367 cycles, with an average 
of 0.183 cycles (Table 1 and Supplementary Material). This 
corresponds to an estimated variation in molecules that ranges 
from ±2.3 to ±26.6% with an average of ±12.4%, using an 
amplification efficiency of 90% taken from the slope-based 
estimate of amplification efficiency determined below. 

Based upon the average standard deviation produced from 
each individual run, estimates of the intra-mn variation were 
similar for both amplicons, ranging from ±9.6% to 14.9% of 
molecules (runs 1-5, Table 1). When combined with inter-run 
variation, this increased to ±17.4 and ±21.3% of molecules 
for each amplicon, respectively, based upon averaging the 
standard deviation in C t for each DNA concentration from all 
runs ('Combined', Table 1). These variations, although 
significant, indicate that Q values have an acceptable level 
of reproduciblity over the six magnitudes of target DNA 
concentration that were examined. 

Standard curve construction and evaluation 

Evaluation of the quantitative variation between replicate 
standard curves was conducted by generating N t and %E S 
values for each amplification run listed in Table 1 . This was 
done by exporting the Q values into a spreadsheet, and 
calculating the slope and intercept for each run using linear 
regression analysis of log(N 0 ) versus Q. Two methods were 
then used to assess the quantitative variation between the five 
replicate standard curves constructed for each of the two 
amplicons (Table 2). 
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Run 2 



Run 4 



Run 5 



Combined 



9.470 ±0.112 
13.503 ± 0.338 
17.422 ±0.119 



10.178 ± 0.178 
13.519 ± 0.214 
17.257 ± 0.206 

24.150 ± 0.036 
27.660 ± 0.076 
±0.143 
±9.6 



9.907 ± 0. 
13.535 
16.972 ± 0. 
20.863 ± 0. 



9.892 ± 0.136 
13.556 ± 0.195 
17.105 ± 0.280 
20.815 ± 0.250 
24.247 ± 0.147 
27.884 ± 0.264 
±0.212 



10.006 ± 0.161 
13.573 ± 0.347 
17.346 ± 0.093 
21.302 ± 0.156 
24.353 ± 0.256 
28.190 ± 0.226 
±0.207 



9.893 ± 0.279 
13.537 ± 0.232 
17.220 ± 0.234 
21.045 ± 0.284 
24.172 ± 0.196 
27.851 ± 0.275 
±0.250 
±17.4 



N 0 Run 1 Run 2 Run 3 Run 4 Run 5 Combined 

4.17 X 10 7 8.602 ± 0.235 8.491 ± 0.140 8.444 ± 0.099 8.752 ± 0.127 8.841 ± 0.309 8.626 ± 0.235 

4.17 X 10 6 12.349 ± 0.141 11.979 ± 0.137 12.291 ± 0.083 12.539 + 0.367 12.312 ± 0.258 12.294 ± 0.271 

4.17 X 10 5 15.794 ± 0.138 16.049 ± 0.155 16.089 ± 0.227 16.106 ± 0.194 15.644 ± 0.177 15.936 + 0.248 

4.17 X 10 4 19.701 ± 0.171 19.549 ± 0.209 19.828 ± 0.261 19.715 ± 0.319 19.203 ± 0.184 19.599 ± 0.305 

4.17 X 10 3 22.574 ± 0.184 22.904 ± 0.182 22.920 ± 0.269 23.311 ± 0.163 23.388 ± 0.151 23.019 ± 0.351 

4.17 X 10 2 26.114 ± 0.113 26.447 ± 0.065 26.436 ± 0.215 27.039 ± 0.122 27.006 ± 0.223 26.608 ± 0.395 

Average SD a ±0.164 ±0.148 ±0.192 ±0.215 ±0.217 ±0.301 

%Mol b ±11.1 ±10.0 ±13.0 ±14.8 ±14.9 ±21.3 

Average C t values (runs 1-5, ± standard deviation, n = 4) for 
concentration from all runs (combined, ± standard deviation, t 
corresponding percentage of molecules. 
"Average of the standard deviations from each data set. 

b Average standard deviation converted into percent molecules using equation 8. 



Examination of the absolute values of N, and %E S revealed 
similar trends for both amplicons, with an inter-curve 
variation in %E S of ±2.2 and ±2.1%, and variation in N, of 
± 19.0 and ± 14.7%, respectively, as based upon their standard 
deviations (Table 2). Taken individually, the magnitude of 
variation in %E S and N t suggests the resulting variation in N 0 
determination could be large. For example, for a Q of 25 
cycles, a ±2.2% variance in the estimate of amplification 
efficiency would produce an approximate ±33% variation in 
N 0 that, when combined with the apparent ± 19% variance in 
N t , could produce an overall variation of about ±52% for N 0 . 
It must be noted, however, that further examination suggests 
that these estimates of variance are most certainly erroneous, 
due to an apparent intra-curve correlation between slope and 
intercept. 

Comparing the %E S and N t values generated from each 
individual standard curve reveals that for both amplicons, the 
curve that produced the highest %E S also produced the highest 
N t (Table 2, K1/K2, run 2 and K3/K2, run 1). Similarly, the 
standard curves producing the lowest %E S also had the lowest 
N t (Table 2, K1/K2, run 1 and K3/K2, run 5). Taken together, 
these trends suggest that variations in intercept and slope are 
not solely caused by inter-run variation in instrumentation 
and/or amplification, but also reflect an innate characteristic of 
linear regression in which variations in slope can be compen- 
sated for to some degree by a corresponding variation in 
intercept. 

This can be best illustrated through an alternative approach 
to evaluating quantitative differences produced by replicate 
standard curves. As illustrated in Table 2, inter-curve variation 
can be estimated by comparing the calculated N 0 for a series of 
simulated Q values using equation 3. Thus, for the five 
standard curves constructed from the K1/K2 amplicon, the 



calculated N 0 for C,=10 cycles ranges from 3.67 X 10 7 to 4.64 
X 10 7 molecules, with an average of 4. 1 3 X 1 0 7 molecules and 
a standard deviation corresponding to ±8.7% of molecules 
(Table 2). Furthermore, a general increase in variation is 
observed with increasing C, such that for C t = 30 cycles, a 
variation of ±18.1% of molecules is produced. Very similar 
results were produced by the larger K3/K2 amplicon (Table 2). 

Overall, this analysis demonstrates that quantitative vari- 
ations produced by replicate standard curves can be relatively 
small, ranging in this suidy from a low of about ±6% to a high 
of about ±21% depending on the number of cycles needed to 
reach threshold. The observed inter-curve variation in the 
absolute values of slope and intercept also suggests that curve- 
based estimates of amplification efficiency and N t require a 
larger data set than would normally be used for construction of 
a single standard curve. Indeed, the relative accuracy of the N t 
estimates for each of the two amplicons can be tested through 
the correlation of their respective M t values, as described by 
equation 7. Based upon the N t values derived from each 
respective 'combined' data set, the estimated M t values differ 
by 7.3% (Table 2). This provides support for both the optical 
precision of the instrumentation and similarity in the SYBER® 
Green 1 fluorescent characteristics of these two amplicons. 



DISCUSSION 

Despite the extensive use of the threshold method for absolute 
quantification, there exists a paucity of studies that have 
examined the utility of the underlying mathematics. 
Furthermore, the general simplicity and widespread use of 
standard curves has led to the automation of quantitative 
determinations, which can obscure the mathematical prin- 
ciples upon which the analysis is based. Familiarity with the 
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Table 2. Summ i I l! Ii regi ionanaksi c ulucl I lor each amplification run listed in Table 1 and an evaluation of inter-curve variation 







%E S 


N, 


Run 1 


0.9971 


87.4% 


1.96 X 10 10 


Run 2 


0.9997 


92.6^ 


3.27 X 10 10 


Run 3 


0.9995 


91.7% 


2.74 X 10 10 


Run 4 


0.9999 


89.8% 


2.43 X 10 10 


Run 5 


0.9992 


88.3 r ( 


2.39 X 10 10 




%CV d 


+ii % 


+ 19.0% 




Average 


90.0% 


2.56 X 10 10 


Combined 


0.9994 


90.0Q 


2.52 X 10 10 






2.83 ng 





±12.8% ±18.1% 



Run 2 
Run 3 
Run 4 



0.99X7 

0.9992 
0.9987 
0.9999 
0.9987 
%CV d 

0.9999 



2.63 ng 



The spreadsheet used for diis analysis is available as Supplementary Material. 
"Linear-regression analysis of the data sets from Table 1 based upon equations 5 and 6. 
b N 0 calculated using equation 3, for the specified C t values using the E s and N, from each n 
°Pearson correlation coefficient. 
d (SD/Average) X100%. 

e DNA mass at threshold calculaicil for the 'combined' N t using equation 7. 



fundamentals of PCR mathematics cannot only yield impor- 
tant insights, but as well provide a foundation from which to 
address some of the major limitations of quantitative kPCR. 

At the most basic level, the threshold method does not 
generally provide an effective indication of quantitative 
precision or accuracy. Although the standard deviation in Q 
produced from replicate amplifications can provide an 
estimate of reproducibility, there is a general deficiency in 
reporting the errors associated with standard curve construc- 
tion. This makes it difficult to evaluate the effectiveness of any 
specific quantitative determination, or of comparing results 
produced by different studies. As was demonstrated in this 
study, a basic assessment of standard curve construction can 
be conducted, if it is understood that slope and intercept are 
directly correlated to amplification efficiency and the number 
of amplicon molecules at threshold (N t ), respectively. 

In this study, comparison of replicate standard curves 
revealed potentially large inter-curve variations, based upon 
the absolute values of slope and intercept. This initially led to 
the conclusion that this was caused by substantial inter-run 
variations in amplification and/or instrumentation. However, 
upon closer examination, an intra-curve correlation between 
slope and intercept became apparent, such that differences in 
slope are compensated for to a significant degree by 
corresponding differences in intercept. 

This can be demonstrated through simple mathematical 
modeling, in which the initial number of target molecules (N 0 ) 
is calculated for a series of simulated C t values (equation 3, 
Table 2). This showed that despite the differences in the 



absolute values of amplification efficiency and N,, the 
resulting N 0 values generated by each standard curve were 
unexpectedly similar. Based upon this analysis the application 
of a single, well-constructed standard curve could provide an 
estimated precision of ±6-21% of molecules, depending on 
the number of cycles required to reach threshold. 

Notwithstanding the interrelationship of slope and inter- 
cept, it must be stressed that the mathematics of PCR dictate 
that amplification efficiency and N t are independent entities. 
In reality N, is determined solely by the fluorescent threshold 
(F,), and as such its value is independent of the parameters 
impacting PCR amplification. Indeed, this interrelationship 
between N t and F, has important practical implications, based 
on the principle that F t does not directly reflect the number of 
amplicon molecules, but rather DNA mass at fluorescent 
threshold (M t ). This in turn dictates that M t could be used to 
predict N, for any amplicon of known size, if it is assumed that 
amplicon size and base composition do not significantly 
impact DNA fluorescence. Support for the validity of this 
assumption was provided by the N t estimates generated from 
the two amplicons used in this study, for which the predicted 
M, values differ by 7.3% (equation 7, Table 2). 

The practical significance of this becomes apparent if it is 
noted that N t is the sole determinant of scale (equation 3), the 
accuracy of which is dependent on the quantitative accuracy of 
the DNA standard used for standard curve construction. If, 
however, M t can be used to predict N t with sufficient 
precision, a common quantitative scale could be applied to 
all amplicons. In addition to circumventing the necessity of 
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preparing a quantified DNA standard for each individual 
amplicon, the major source of variation in quantitative scale 
would become the optical precision of the instrument. Equally 
significant is that absolute quantification would be simplified, 
requiring only determination of amplification efficiency once 
M t has been established. 

SUPPLEMENTARY MATERIAL 

Supplementary Material is available at NAR Online. 
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